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If  one  has  observed  n  independent  and  identically  dis 
tributed  random  variables  X,,  ...,  X  ,  it  is  often  of  in¬ 
terest  to  estimate  their  mean  u  =  ECX^)  and  variance 
o2  =  Var(X^).  If  X-^  has  a  normal  distribution  then 

n  n-1 

X  =  l  X-/n  and  s2  =  £  (X.-X)2/(n-l) 

i= 1  1  i= 1  1 

are,  respectively,  the  usual  unbiased  estimators  of  p  and 
of  o2 ,  and  are  in  many  senses  optimal  estimates.  Also,  it 
is  known  that 


i  .  e  .  -u  )  /  C  s/  /n) 
degrees  of  freedom, 
point  estimate  of  p 
val  on  p : 


X  -  p 


'v  t 


n-1 


s//H" 

has  Student’s-t  distribution  with 
which  allows  one  not  only  to  give 
,  but  also  a  lOO(l-a)9-  confidence 


n-1 

a 

inter 
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X  —  t  i  .a  -  <  p  <  X  +  t  ,  ,  a  -  . 

n_1 ’  1-T  /FT-  —  -  n_1  * 1  ~2  /n~ 

A  problem  with  this  interval  is  that  its  length, 

2t  1  —  > 

11-1,1  7  a r 

is  a  random  variable  and  cannot  be  controlled  to  be  (e.g.) 
<_  2L  by  choice  of  the  sample  size  n.  The  solution  of  this 
problem  given  below  (Section  1)  is  the  base  of  many  solu¬ 
tions  to  problems  of  heteroscedasticity . 

If  X-^  has  a  non-normal  distribution,  then  X  and  s2 
are  still  unbiased  estimators  of  p  and  o2 ,  respectively. 
However,  X  and  s2  are  no  longer  independent  random  vari¬ 
ables  and  confidence  intervals  for  p  (even  with  random 
length)  are  not  generally  available.  Moreover,  X  and  s2 
(while  each  asymptotically  normal  by  the  Central  Limit 
Theorem)  are  no  longer  optimal  estimators  in  this  setting. 
Here  transformations  are  often  used,  i.e.  one  lets 
Yf  =  C(X^),  1  <_  i  <_  n,  for  some  function  £(•)  such  that 
C )  is  normally  distributed.  Then  E^^)  and  Var(Y1>  can 
be  estimated  optimally,  and  confidence  intervals  provided. 
Methods  are  available  (Section  2)  for  relating  these  esti¬ 
mates  and  intervals  back  to  p  and  o2,  the  quantities  of 
primary  interest. 

If  one  has  observations  from  several  sources  ,  say 
X.lt  X,-  ...  (which  are  independent  and  identically 
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distributed)  from  source  i  (i  =  1,  2,  .  ..,  k),  interest 
is  often  in  the  means  y^  =  E(X.q)  and  variances 
o?  =  Var(X^).  Until  recently,  the  procedures  available 
for  these  problems  assumed  normality  and 

0^2  =  ...  =  a^2  =  o2  and  provided  performance  characteris¬ 
tics  (e.g.  power  for  a  test,  confidence  coefficient  or 
length  for  a  confidence  interval,  probability  of  correct 
selection  for  a  ranking-and-selection  procedure)  which 
depended  on  the  unknown  o2.  The  solutions  given  below  for 
these  problems  (Sections  3,  4,  5)  do  not  assume  equal  var¬ 
iances,  and  yet  do  allow  full  control  of  such  performance 
characteristics  as  power,  confidence  interval  length,  and 
probability  of  correct  selection. 

The  procedures  given  require  the  experimenter  to  have 
design-control,  but  generalize  to  any  statistical  problem 
via  the  Heteroscedastic  Method  (Section  6).  Problems  of 
non-normality,  comparison  with  the  usual  variance-stabili- 
zing-transformation  approach,  and  other  comparisons  and 
questions  which  arise  in  practical  implementation,  are 
discussed  throughout. 

1.  Basic  Sampling  Rule  (nQ  ,  w). 

If  we  are  able  to  observe  independent  and  identically 
distributed  normal  random  variables  X^ ,  X2»  with  mean 
y  and  variance  a2  (both  unknown),  and  wish  to  make 


! 
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inference  about  p  ,  let  us  use  the 


Sampling  Rule  ^  Cng  ,  w):  Take  an  initial  sample 
Xi ,  .  .  .  ,  Xn  of  size  nQ  (>_  2),  and  calculate 


X(ng)  =  £  X-/nQ,  s2  =  l  (X.-X  (nQ))2/(n0-l), 

1=1  J  1 = 1  3 


N  =  max  {n0  +  1 ,  [ (ws  ) 2  ]  } 

where  w  >  0  (depends  on  the  problem  under  consideration) 
and  [y]  denotes  the  smallest  integer  ^  y  (e.g.  [5.1]  =  6 
...  introduced  because  sample  sizes  must  be  integers). 
Take  N-  additional  observations  Xn  +1 ,  ...,  X^  and 


calculate : 


X(N-n0)  =  l  X./(N-nn), 
1 =nn+l  3 


X  =  bX(nQ)  +  (l-b)X(N-nQ) 


where 


■> ■  H  -  V-  &  “ 


X  =  l  X./N. 


Then  Stein  showed  in  1945  that,  for  his  sampling  rule 


%  -  y  . 

1/w'  tnQ-l' 


There fore 


X  -  t 


,  a  —  <  u  <  X  +  t  .  ..  a  — 
n n  1  >  1 —  o  w  nn-^  ’  1  “ 2  '**' 


‘0  "  ’ "  2  0 
is  an  exact  100(l-a)%  confidence  interval  for  p  ,  and  its 
half-length  can  be  fully  controlled  to  a  pre-set  number 
L  >  0  by  taking  the  arbitrary  w  >  0  in  &R(nn,  w)  such  that 

t 


n0  1 > 1~2  ^ 


a  —  =  L,  i.e.  w 


'B  O 
'n0_1,1"7 


Note  that,  since  N  is  an  increasing  function  of  w,  the 
total  sample  size  N  required  is  larger  for:  smaller  half- 
length  L;  as  well  as  for  higher  confidence  1-a. 

The  above  procedure  is  valid  for  any  preliminary  sam¬ 
ple  size  n„  >  2.  Since  t  ,  ,  a  decreases  as  nn  in- 
0  —  nQ-l,l-y  u 

creases,  it  is  reasonable  to  make  n^  large  if  possible. 
However,  the  decrease  is  negligible  after  n^  >_  12  or  so, 
hence  it  is  reasonable  to  take  nQ-12  (or  larger  if  one  is 
sure  to,  e.g.,  take  n^  >_  B  for  some  positive  integer 
B  >  12). 

The  validity  of  the  procedure  for  any  (with  no 
"optimal"  choice  of  n^  being  obvious)  bothered  early  work¬ 
ers  in  the  field  and  led  to  dis-use  of  these  early  proce¬ 
dures.  The  above  realization  that  ng  >  12  is  all  that  is 
required  for  good  results  in  practice  as  far  as  is  con¬ 
cerned  is  a  factor  leading  to  great  current  interest  in 
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these  procedures  and  their  extensions  (sections  to  follow 
below).  One  may  think  of  the  situation  as  follows:  one's 
total  sample  size  N  is  approximately  w2 s 1 ,  and  taking  nr 

U 

very  small  will  force  a  large  total  sample  size  simply  be¬ 
cause  of  a  poor  initial  estimate  s2.  E.g.,  see  Table  1. 

2.  Basic  Non-Normality  and  Transformations. 

If  X^  in  Section  1  is  non-normal ,  one  often  uses  such 
transformations  as 

C1(x1)  =  /xpi 
C2<x1)  =  Xj1-'3 

W  =  1°6io(Xl> 

C4(X1)  =  arcsin  /T^- 
C^(Xf)  =  sinh-1  ✓X,  . 

If  one  of  these,  say  £(X^),  is  normally  distributed,  then 
the  mean  and  variance  of  =  £(X^)  may  be  estimated  by 

7  =  l  Y./n,  s2  =  l  (Y.-Y)2/(n-i). 

3=1  3  Y  3=1  3 

However,  interest  in  many  cases  is  not  in  Ey(X. )  and 
Var£(X^),  but  in  the  original  problem  units,  i.e.  EX^  and 
Var(X^).  Simply  using  the  inverse  transformation,  e.g. 
estimate  EX^  by 

Y2  +  a 

in  the  case  of  ,  results  in  a  biased  estimate.  However, 
J.  Neyman  and  E.  L.  Scott  showed  in  1960  that  the  unique 
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minimum  variance  unbiased  estimators  of  EtX^)  are  as  in 
Table  2.  General  results  for  second  order  entire  functions 
were  also  given  by  Neyman  and  Scott. 

In  1968  M.  H.  Hoyle  provided  the  MVUE's  of  Var(X^) 
and,  more  importantly,  of  the  variances  of  the  estimators 
of  EX^  given  in  Table  2;  see  Table  3.  These  latter  can 
be  used  to  obtain  approximate  95%  confidence  intervals  for 


EX^ ,  e.g.  when  using  /X^-a : 

pe72+a+(l-±.)s2  ^V^s2  72  +  s*  {  (1-J)  2-^(l>2  ( 1-|)  2  + 3  ( 1-i)  ** ) }. 


3.  Heteroscedasticity  (Several  Sources):  Tests. 

If  ,  X.^  ,  are  independent  and  identically  dis¬ 

tributed  normal  random  variables  with  mean  and  variance 
ai2  (i  =  1,  2,  k),  then  one  talks  of  homoscedasticity 

if  o^2  =  ...  =  ak2 ,  and  of  heteroscedasticity  otherwise. 
Experimenters  have  often  been  cautioned  that  "the  assump¬ 
tion  of  equal  variability  should  be  investigated"  (e.g.  by 
Cochran  and  Cox  in  1957,  by  Juran,  Gryna,  and  Bingham  in 
1974),  but  no  exact  statistical  procedures  have  been  avail¬ 
able  for  dealing  with  cases  where  one  f^'nds  that  variabili¬ 
ties  are  in  fact  unequal.  (A  variance-stabilizing  trans¬ 
formation  is  commonly  employed,  e.g.  arcsin  for  binomial 
data;  however,  if  is^  normal,  then  £(X— )  will  be  non¬ 
normal:  the  transformation  method  has  not  been  developed 

to  handle  this  problem  except  in  special  cases,  and  even 
there  one  deals  not  with  the  parameters  y, ,  .  .  .  ,  y,  of 
basic  interest  if  one  uses  such  a  transofrmat ion ,  but 
rather  with  some  transform  whose  meaning,  i.e.  interpreta- 
bility,  will  not  often  be  clear.  We  do  nor  therefore  re¬ 
gard  transformations  as  of  general  use  in  cases  of  k  >  2 
when  y^,  .  .  . ,  are  parameters  of  natural  interest  (not 
arbitrary  parametrizations ) . ) 

It  was  first  developed  in  the  1970’s  by  E.  J.  Dudewicz 
that,  applying  sampling  procedure  8B(n0>  w)  of  Section  1 


separately  to  each  source  of  observations,  one  would  obtain 


the  ability  to  fully  control  the  performance  characteris¬ 
tics  of  statistical  procedures  even  in  the  presence  of 

'V 

heteroscedasticity .  Let  X^  denote  the  result  of  applying 
the  sampling  procedure  to  X^  ,  X^2  ,  ...  (i  =  1,  2,  k). 

In  the  case  k  =  1  one  can  develop  procedures  (as  in  Section 
2)  using  X,  but  if  one  replaces  this  by  X  at  the  end  then 
the  procedure  is  still  valid.  In  fact,  it  has  slightly 
better  performance  characteristics  (higher  power)  ,  and  is 
even  simpler  (X  being  simpler  than  X,  the  latter  being  a 
random-weighted  combination  of  the  sample  means  of  the 
first  and  second  stages  of  sampling).  However,  this  im¬ 
provement  is  not  large:  approximately  the  amount  that 

increasing  sample  size  from  N  to  N  +  1  will  buy  (one  ob- 

% 

servation's  worth).  This  improvement  of  X  over  X  has  been 
shown  to  not  hold  in  most  situations  where  k  .>  2  :  in  most 

%  'V 

such  cases  if  Xp  .  .  .  ,  X^  are  used  to  replace  X-^ ,  X^ 

then  the  procedure  no  longer  has  the  desired  performance 
characteristics . 

We  will  describe  the  new  ANOVA  procedures  in  the  con¬ 
text  of  the  one-way  layout;  similar  procedures  are  avail¬ 
able  for  higher-way  layouts.  In  the  one-way  layout,  we 
might  want  to  test  the  null  hypothesis 


Define 
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where 


and  reject  H 


0 


F 


k 

l 

i  =  l 


w 


'V, 


x.  =  J-  y 

k  i=i 


if  and  only  if 
% 


F  >  F 


a 


-  X.)2 


% 

X. 


k  ,n. 


where  f“  is  the  upper  percent  point  of  the  null  dis- 

K,n0  % 

tribution  of  F.  This  null  distribution  is  the  same  as  that 

k  _ 

of  the  random  variable  Q  =  £  (t.-t.)2  where  the  { t - }  are 

i  =  l  1  1 

independent  identically  distributed  Studen:'s-t  variates 

k 

with  n„-l  degrees  of  freedom  and  t.  =  (1/k)  £  t-. 

-v  i=  1 

Values  of  F;.  obtained  by  a  Monte  C  lrlo  sampling 
,n0 

experiment,  along  with  the  power  attained  it  various  alter- 

k  __ 

natives  measured  by  5  ;  7  (p.-u.  )2,  for  v  irious  given 

i  =  l  1 

1/w2  values,  are  presented  in  Bishop  and  D  ’dcricz  (1978). 
There  is  a  need  for  approximations  to  the  percentage  points 

'V 

of  the  F  statistics  under  the  null  and  alternative  distri¬ 
butions.  Such  approximations  are  available  in  the  general 
setting  (see  Dudewicz  and  Bishop  (1979)),  ind  have  been 
studied  as  to  goodness  in  special  cases  ( s  ;e  Bishop, 

Dudewicz,  Juritz,  and  Stephens  (1978)).  Consider  first 

% 

the  distribution  of  F  as  -*  «,  This  limiting  distribu¬ 
tion  is  noncentral  chi-square  with  k-1  degrees  of  freedom 
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and  noncentrality  parameter  A  =  £  w2(p.-g.)2,  denoted  by 

i  =  1 

X^_^(A).  However  numerical  results  indicate  that  for 
small  ng  the  tails  of  this  limiting  distribution  are  too 
light  to  give  a  good  approximation.  One  therefore  approxi¬ 
mates  by  a  ( (n^-l )  /  (ng-3  ) )  •  random  variable  (in 

which  case  F  and  its  approximating  distribution  have  the 
same  expected  value  under  Hq). 

Let  us  illustrate  with  a  numerical  example.  Suppose 
we  wish  to  test  the  hypothesis  that  4  different  chemicals 
are  equivalent  in  their  effects.  Suppose  we  decide  to 
take  initial  samples  of  size  10  with  each  treatment,  want 
only  a  5%  chance  of  rejecting  HQ  if  in  fact  HQ  is  true, 
and  want  an  85%  chance  of  rejecting  HQ  if  the  spread  among 
pj »  43»  4^  is  at  least  4.0  units.  We  then  proceed, 
step  by  step,  as  follows. 

Step  1.  (Problem  specification.)  Here  there  are  k  =  4 
sources  of  observations,  we  desire  an  a  =  .05  level  test  of 
Hq  :  =  ^2  =  43  =  ,  and  if  the  spread  among  p-p  p2,  43» 

P4  is  6  =  4.0  units  or  more  we  desire  power  (probability 
of  then  rejecting  the  false  hypothesis  HQ  )  of  at  least 
P*  =  .85. 

Step  2 .  (Choice  of  procedure.)  Assuming  we  do  not  know 

that  a|  =  o|  =  o3  =  o^,  only  procedure  Sg(ng,  w)  can 

guarantee  the  specifications.  It  requires  we  sample 
observations  in  our  first  stage,  and  recommends  ng 


be  at  least  12  (though  any  nQ  >_  2  will  work).  Suppose  the 
experimenter  only  wants  to  invest  40  units  in  first-stage 
experimentation  and  sets  n^  =  10. 

Step  3.  (First  stage.)  Draw  n^  =  10  independent  observa¬ 
tions  from  each  source,  with  results  as  in  Table  4. 


Table  4.  First  Stage  Samples. 


Chemical  1 

Chemical  2 

Chemical  3 

Chemical  4 

77 . 199 

80 . 522 

79.417 

78.001 

74.466 

79.306 

78.017 

78.358 

82  .  746 

81 . 914 

81.596 

77.544 

76.208 

80. 346 

80.802 

77.364 

82 . 876 

78. 385 

80.626 

77.554 

76.224 

81. 838 

79.011 

75.911 

78 . 061 

82 . 785 

80 . 549 

78.043 

76.391 

80 . 900 

78.479 

78.947 

76.155 

79.185 

81.798 

77.146 

78.045 

80.620 

80.923 

77.386 

Step  4 .  (Analysis  of  first  stage  data.)  We  now  calculate 
the  first  stage  sample  variances  s^,  Sj ,  ,  the  total 

sample  sizes  needed  from  the  four  sources  ,  N3,  N^,  N4 , 
and  the  factors  b^  ,  b^  ,  b3  ,  b^  to  be  used  in  the  second 
stage  analysis.  The  s?'s  are  given  in  Table  5,  along  with 
the  other  quantities.  The  w  needed  is  found  as  follows. 
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We  desire  power  P*  =  .85  (Step  1  above)  when 


A  =  2^1  =  ,  4.0w> 


To  set  w  for  this  power  requirement,  we  first  need  to  know 

"When  do  we  reject?".  We  know  we  will  later  reject  H0  if 
^  %  05 

F  >  where,  approximately, 

^  n  c.  n  «  —  1 

Fi,io  =  iqrj  <7-81>  =  I0-0-- 

The  7.81  is  the  value  a  central  chi-square  random  variable 
with  k-l=4-l=3  degrees  of  freedom  exceeds  with  pro¬ 
bability  a  =  .05  (see  standard  tables,  e.g.,  p.  137  of 
Pearson  and  Hartley  (1970)  or  p.  459  of  Dudewicz  (1976)). 
The  power  will  be,  approximately , 

P[X3  (A)  >  7.81]  =  .85 

if  (see  p.  53  of  the  tables  in  Haynam,  Govindara julu ,  and 
Leone  (1970  )) 

A  =  12 . 301  , 
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Table  5.  Analysis  of  First  Stage. 


ate 

Chemical  1 

Chemical  2 

Chemical  3 

Chemical  4 

»* 

n0 

10 

10 

10 

10 

-•  • 

Sample  Mean 

77 . 837 

80 . 580 

80.122 

77.625 

•  * 

s? 

X 

7 . 9605 

1 .8811 

1.7174 

.  6762 

w 

1.754 

1 .754 

1.754 

1.754 

*  * 

N. 

l 

25 

11 

11 

11 

*  • 

b. 

X 

.  330 

.936 

.  9  39 

.  969 

Step  5.  (Second  stage.)  Draw  -  nQ  observations  from 
source  i  (i  =  1,  2,  3,  4),  yielding  Table  S. 


Table  6.  Second  Stage  Samples. 


i; 

£. 

Chemical  1 

Chemical  2 

Chemical  3 

Chemical  4 

82.549 

79.990 

80.315 

78.037 

\- 

78.970 

78.496 

r 

78.494 

| 

80.971 

r 

80 . 313 

76.556 

•  - 

80.115 

78.659 

77.697 

80.590 

79.647 

**«* 

82.733 

80.552 

i 

79.098 

i 

— - i 
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Step  6.  (Final  calculations.)  We  now  calculate  the  X.  and 
a 

F,  and  find 

a  a  a  % 

Xx  =79.079,  X2  =  80.688,  X3  =  80.197,  X4  =  77.597 

a 

X.  =  79.390  , 
a 

F  *  17. 38  . 


a 

Step  7 .  (Final  Decision.)  Since  F  =  17.33  exceeds 
^05 

F'  =  10.04,  we  reject  the  null  hypothesis  and  decide  the 
chemicals  differ. 


4.  Heteroscedasticity  (Several  Sources):  Confidence 

Intervals . 

The  case  of  a  confidence  interval  with  k  =  1  mean  was 
considered  in  Section  1.  When  k  =  2,  a  two-sided  confidence 
interval  of  half-length  L  >  0  and  with  confidence  coeffi¬ 
cient  1-a  is  given  by 

an.  a  a 

(VX2 )  -  L  <  Pi  -  P2  £  +  L 

if  we  choose  (in  gg  (nQ,  w)) 

c,  a  (nn ) 

1"7  0 

w  =  - 

L 


where  c  is  tabled  in  Table  7 . 


Table  7.  ci_-y 


3  0 

11 

12 

13 

14 

15 

20 

25 

30 

.75 

1.03 

1 .02 

1.02 

1.01 

1.01 

1. 00 

.  99 

.98 

.98 

.80 

1.29 

1.28 

1 .27 

1.26 

1.26 

1.25 

1.  24 

1.23 

1.22 

.85 

1.60 

1.59 

1.57 

1.56 

1.56 

1.  55 

1 .  53 

1 .  51 

1.51 

.90 

2.00 

1.98 

1.96 

1.95 

1.94 

1. 93 

1. 90 

1 . 88 

1.87 

.95 

2.61 

2 .58 

2 . 56 

2.53 

2.52 

2.  50 

2 .45 

2.42 

2.41 

.975 

3.18 

3.13 

3.09 

3.06 

3.04 

3.  02 

2.95 

2  .  91 

2 .88 

.99 

3.89 

3.82 

3.76 

3.71 

3.67 

3.64 

3  .  54 

3.48 

3.45 

.995 

4.41 

4.31 

4.24 

4.18 

4.13 

4.09 

3  .  96 

3.89 

3.85 

.999 

5.61 

5.45 

5.32 

5.22 

5.14 

5.07 

4  .  86 

4 . 74 

4.67 

Note  that  the  corresponding  test  solves  the  Behrens- 
Fisher  problem  exactly  in  two  stages,  with  controlled  level 
and  power. 

For  k  >  2,  multiple-comparison  procedures  are  also 
available  for  many  of  the  usual  multiple-comparison  confi¬ 
dence  interval  goals. 


5.  Heteroscedasticity  (Several  Sources):  Ranking  and 
Selection 

Here  k  >  2  and  we  wish  to  select  (indifference-zone 


formulation)  that  source  with  mean  value  max  (y^,  .  ..,  y^.). 

Let  ijQ-j  <_  .  .  .  <  denote  the  ordered  values  of 

Mf,  ■■■>  Uk;  thus  U[k]  denotes  max  (y^,  ...,  ) ,  etc.. 


The  performance  characteristic  of  interest  is  the  probabi¬ 
lity  that  we  will  make  a  correct  selection  (CS)  ,  i.e.  that 
the  population  actually  selected  is  the  one  which  has  mean 


I 

| 

j 

j 


y[k]‘  following  Bechhofer  (1954),  we  require  that  our 
P(CS)  have  at  least  a  specified  value  P*  (1/k  <  P*  <  1) 
whenever  the  largest  mean  is  at  least  6*  (0  <  6*)  more  than 
the  next-to-largest  mean;  i.e.  we  require 


P(CS)  >  P*  whenever  1  <$*. 

This  problem  was  considered  by  Dudewicz  and  Dalai  (1975). 
The  procedure  is  to  select  that  source  which  yields  the 

'V  % 

largest  of  ,  . . . ,  ;  i.e. 

<V,  'V,  “V 

Select  tt ^  iff  X^  =  max(X^,  ...»  X^). 

In  the  sampling  rule  gg(nQ,  w)  one  chooses 


w 


5 


where  CpA(nQ)  (for  specified  values  of  P*  and  n0>  is  tabled 
in  Table  7  for  k  =  2,  and  is  tabled  in  Dudewicz,  Ramberg, 
and  Chen  (1975)  for  k  >  2.  Approximations  for  k  >  25  are 
given  by  Dudewicz  and  Dalai  (1975).  Subset-selection  pro¬ 
cedures  are  also  available  there. 
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6.  The  Heteroscedasfic  Method. 

The  above  special-case  solutions  have  been  placed  into 
a  general  theory  with  the  Heteroscedast  ic  i'ethod  of  Dudewic 
and  Bishop  (1979).  In  a  general  decision-theoretic  setting 
they  show  how  to  develop  procedures  like  the  above  in  any 
problem.  It  is  also  shown  that  no  single-stage  procedure 
can  solve  most  such  problems. 

Some  questions  one  might  ask  about  the  procedures  thus 
produced  are  as  follows.  First,  how  do  they  perform  under 
violation  of  normality?  Here,  Iglehart  (1977)  has  shown, 
in  some  computational  settings,  that  replacing  s2  by  a  jack 
knife  estimator  is  sufficient  to  preserve  the  main  proper¬ 
ties  of  the  procedures.  Other  recent  work  (Dadewicz  and 
van  der  Meulen  (1980))  shows  asymptotic  validity  under 
asymptotic  normality.  Second,  are  they  preferable  to  com¬ 
parable  sequential  procedures?  Here  it  should  be  noted 
that  in  most  cases  there  are  no  "comparable"  sequential  pro 
cedures:  the  sequential  procedures  of  Chow-Robbins  type 

usually  mentioned  only  have  asymptotic  validity  even  under 
exact  normality,  while  the  »g(ng,  w)-based  two-stage  proce¬ 
dures  have  exact  known  properties.  It  is  sometimes  claimed 
that  the  sequential  procedures  are  more  efficient,  but  this 
is  only  as  (e.g.)  c|  0.  The  so-called  inefficiency  of 
3g(n0,  w)  in  this  situation  is  due  to  the  fact  it  then 
requires  N  =  ng  +  1  (since  N  ng  +  1  always)  and  in  fact  (as 
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ck2  ■*  0)  an  N  ■+  1  will  suffice.  This  appears  to  have 
little  practical  relevance,  as  one  usually  knows  trivial 
sample  sizes  will  be  insufficient  for  one's  problems;  it 
is  rather  a  curiosity  of  mathematical  interest  only. 

As  a  final  note,  we  mention  that  while  variance- 
stabilizing  transformations  and  other  approximate  methods 
have  existed  for  many  years,  most  experimental  situations 
are  such  that  the  problem  is  far  from  solved  by  these 
approximate  methods.  For  example,  such  methods  misalloeate 
sample  size  by  taking  the  same  sample  size  from  a  treatment 
with  relatively  small  variability,  as  from  a  treatment  with 
relatively  large  variability,  even  though  the  need  for  ob¬ 
servations  on  the  latter  is  substantially  greater  and  they 
have  a  greater  beneficial  effect  on  performance  character¬ 
istics  of  the  overall  analysis.  Also,  procedures  based  on 
5g(nQ,  w)  behave  acceptably  even  if  variances  are  equal, 
hence  the  equality  of  variances  tests,  which  are  known  to 
be  weak  in  power,  can  be  skipped  and  these  new  procedures 
applied  directly  without  regard  to  equality  or  inequality 


of  variances. 
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